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Abstract 

Sensitivity to luminance difference, or contrast sensitivity, is critical for animals to survive in and interact with the external 
world. The contrast sensitivity function (CSF), which measures visual sensitivity to spatial patterns over a wide range of 
spatial frequencies, provides a comprehensive characterization of the visual system. Despite its popularity and significance 
in both basic research and clinical practice, it hasn't been clear what determines the CSF and how the factors underlying the 
CSF change in different conditions. In the current study, we applied the external noise method and perceptual template 
model to a wide range of external noise and spatial frequency (SF) conditions, and evaluated how the various sources of 
observer inefficiency changed with SF and determined the limiting factors underlying the CSF. We found that only internal 
additive noise and template gain changed significantly with SF, while the transducer non-linearity and coefficient for 
multiplicative noise were constant. The 1 2-parameter model provided a very good account of all the data in the 200 tested 
conditions (86.5%, 86.2%, 89.5%, and 96.4% for the four subjects, respectively). Our results suggest a re-consideration of the 
popular spatial vision model that employs the CSF as the front-end filter and constant internal additive noise across spatial 
frequencies. The study will also be of interest to scientists and clinicians engaged in characterizing spatial vision deficits 
and/or developing rehabilitation methods to restore spatial vision in clinical populations. 
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Introduction 

Sensitivity to luminance difference, or contrast sensitivity, is 
critical for animals to survive in and interact with the external 
world, e.g. searching for food, locating water source, mating, 
socializing, and disclosing predators from its surrounding camou- 
flages [1]. The CSF, which measures visual sensitivity to spatial 
patterns over a wide range of spatial frequencies, provides a 
comprehensive characterization of the visual system [2-6]. A 
typical CSF resembles a band-pass filter that peaks at intermediate 
frequencies (usually 2-6 c/deg) and drops off at both lower and 
higher spatial frequencies [7-12]. The CSF has been found to 
differ greatiy among species [13], improve with development [14], 
deteriorate with aging [15], and vary with attentional state 
[16,17], luminance level [18], adaptation [19], and a variety of 
visual diseases, including amblyopia [20-22], glaucoma [23,24], 
dyslexia [25], and major depressive disorder [26,27]. Moreover, 
the pattern of CSF changes may vary with clinical conditions. For 
example, amblyopia is largely regarded as a high SF deficit [28,29] 
and dyslexia as a condition with low SF deficits ([30,31], but see 
[32]). 

Models of spatial vision that employ the CSF as the front-end 
SF filter have been developed to account for human performance 
in a wide range of visual tasks, including letter identification 



[33,34] and face recognition [35], implicitly treating the CSF as 
the gain profile of the visual system in the spatial frequency space 
[36]. Although these models provided good accounts of human 
performance in many tasks, equating the gain profile of the 
channels according to the CSF of the observer may be 
problematic. With a contrast matching paradigm, Georgeson 
and Sullivan [36] have shown that the perceived contrast of 
gratings is largely independent of the testing spatial frequencies in 
high contrast conditions, a finding indicative of almost equal gain 
across different spatial frequencies in the visual system. Several 
other studies have also found that adding high magnitude of 
external noise to the to-be-detected gratings can flatten the CSF, 
suggesting that the CSF is not a simple function of the gain of the 
visual system [20,37—40]. In addition, there are also many known 
nonlinearities, including nonlinear transducer function and 
multiplicative noise in the visual system [41-43], which wiU 
inevitably perplex the interpretation of previous data about CSF. 
The spatial vision models that consist of CSF as the front-end filter 
and constant internal additive noise across spatial frequencies may 
account for human performance in certain conditions but not in 
more extended conditions that include a wide range of stimulus 
contrasts, external noise, and performance levels [44]. 
Those models need to be elaborated by fuUy specifying the gain. 
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non-linearity, and noises in the visual system [45]. In the present 
paper, we attempt to directly determine the perceptual limitations 
underlying the contrast sensitivity fimctions by combining the 
external noise method and observer modeling. The result will 
allow us to test the traditional multi-channel model of spatial vision 
and provide new insights for understanding CSF deficits in clinical 
conditions. 

Applications of the external noise method and observer models 
to identify the intrinsic limitations of human observers date back to 
the 1950's [46]. The external noise method measures the threshold 
versus external noise contrast (TvC) function to estimate how 
much signal contrast or feature difference is required for an 
observer to maintain a certain performance level as a function of 
external noise [41,43,46,47]. The TvC function is usually fitted 
with a specific observer model such as the linear amplifier model 
(LAM) [48], the induced noise model [49], the linear amplifier 
model with decision uncertainty [43], the induced noise and 
uncertainty model [42], or the perceptual template model [41]. 
The perceptual template model (PTM) incorporates and integrates 
the major components of the previous observer models and has 
been shown to provide an excellent account of a range of 
psychophysical data [45]. 

To date, the external noise paradigm and observer models have 
been usually implemented in a single SF condition. In the current 
study, we apply the external noise method to characterize the 
limiting factors underlying contrast sensitivity at different spatial 
frequencies [41,45]. The PTM is used to decompose contrast 
sensitivity into four intrinsic hmitations of the perceptual system 
[45]: (1) the gain of the perceptual template, (2) a nonlinear 
transducer, (3) internal multiplicative noise whose amplitude is 
directly related to the total amount of input stimulus energy, and 
(4) internal additive noise whose amplitude is invariant with input 
stimulus. Characterizing the four factors as functions of SF allows 
us to identify the perceptual limitations underlying the CSF. 

Methods 

Subjects 

Four novice adult observers, two males and two females, aged 
21-27 years (24.25±2.50; mean ± s.d.), with normal or corrected- 
to-normal vision, participated in the study. All studies were 
performed with written informed consent of the subjects and 
approved by the research ethics committee of Institute of 



Psychology, CAS and followed the tenets of the Declaration of 
Helsinki. 

Apparatus and Stimuli 

All stimuli were generated by a PC running Matlab and 
PsychToolBox extensions [50,5 1] and presented on a luminance- 
linearized Sony G520 monitor [52] with a vertical refresh rate of 
85Hz, a resolution of 1600x1200 pixels and mean luminance of 
30.6 cd/m^. A special circuit was used to combine two 8-bit output 
channels of the video card to produce 14 bits of gray levels [52]. 

Signal stimuli were vertical sinusoidal gratings at five spatial 
frequencies (SF: 0.5, 1, 2, 4, and 8 c/deg), presented binocularly in 
the center of the display. Each grating consisted of three cycles. 
The size of the gratings was inversely proportional to their spatial 
frequencies (i.e., 576, 288, 144, 72, and 36 pixels), subtending 6, 3, 
1.5, 0.75, and 0.375 degrees of visual angle at a viewing distance of 
138.5 cm. External noise images, constructed from Gaussian 
distributed pixel intensities with eight standard deviations (0, 0.01, 
0.02, 0.04, 0.08, 0.16, 0.24, and 0.32), were combined with signal 
gratings through temporal integration (Figure 1. A). The size of 
the signal grating and external noise images was the same. The 
size of the noise elements was scaled with the signal grating size to 
maintain 1 8 noise elements per image so that the spectra of signal 
gratings and the external noise maintained a constant relationship 
across different spatial frequency conditions (Figure 1). 

Design 

In the main experiment, we adopted the method of constant 
stimuli (MCS) to obtain psychometric functions (PFs) in all forty 
conditions (5 SFs x8 external noise levels). Each PF was sampled at 
five signal contrasts with 90 trials per sample. There were 200 
conditions (5 SFs x8 external noise levels x5 signal contrasts), and a 
total of 18,000 trials per observer. 

Each observer ran 1 5 sessions of 1 200 trials each. A mini-block 
design was used: the 1 200 trials were divided into 6 blocks, each of 
which consisted of five mini-blocks of 40 trials (8 external noise 
levels X 5 signal contrast x 5 repetitions) in a single SF condition. AU 
five SF conditions were tested in each block. External noise and 
signal contrast conditions in each mini-block were randomized. 
The order of SF was randomized in each block. Observers could 
volunteer for a break after every 40 trials. A mandatory one- 
minute break was provided every 80 trials. Each session took 
approximately 75 minutes. 
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Figure 1. Stimuli samples and the spectra of external noise. (A) Noise masl<ed gratings in different spatial frequency conditions (Ng„ = 032). 
(B) The magnitude spectra of external noise in different spatial frequency conditions. Note that different ranges are used for the x-axis in plotting the 
magnitude spectra in different spatial frequency conditions. 
doi:1 0.1 371/journal.pone.0090579.g001 
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Figure 2. The procedure of one trial. Each trial consisted of two intervals separated by 505.9 ms. Each interval started with a 200 ms fixation, and 
followed by two external noise frames, a grating or blank frame, two additional external noise frames. Each frame lasted 35.3ms. 
doi:1 0.1 371 /journal.pone.0090579.g002 



To ensure efficient sampling of the PFs, a Bayesian procedure 
was first utilized to quickly estimate contrast thresholds at 70.7% 
and 79.4% correct in detecting gratings of the five spatial 
frequencies in the zero and highest external noise conditions in 
a pilot study [61]. Rough estimates of the contrast thresholds in the 
remaining noise conditions were linearly interpolated from these 
thresholds. For a given SF and external noise level, we sampled the 
PF at five signal contrast levels using an efficient sampling method 
[53]. 



Procedure 

Subjects performed a two-interval forced-choice (2-IFC) grating 
detection task in both the main and pilot experiments. Figure 2 
depicts a typical trial. Each trial started with a 200-ms frxation 
cross in the center of the display, followed by two stimulus 
presentation intervals, each demarcated by a brief tone and 
consisted of a sequence of 35.3-ms frames: two external noise 
frames, one signal or blank frame, and two additional external 
noise frames. The two intervals were separated by a 505.9-ms 



SF = 0.5 SF = 1 SF = 2 SF = 4 SF = 8 (c/deg) 
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Figure 3. Contrast psychometric functions for the four observers. Each row contains contrast psychometric functions for one observer, with 
increasing spatial frequency from left to right columns. Different colors indicate different external noise conditions. The solid line represents the best- 
fitting model in which the contrast psychometric functions shared the same slope across different spatial frequencies and external noise levels. See 
details in Table 1. 

doi:1 0.1 371 /journal.pone.0090579.g003 
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inter-stimulus interval. Signal appeared in one of the two intervals 
with equal probability. Subjects were asked to report which 
interval contained the signal by a keypress. All external noise 
frames were sampled independendy. No feedback was given. 



Results 
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Psychometric Functions 

Figure 3 shows the PFs in all forty conditions. For each 
observer, the PFs shifted to the right as external noise increased in 
low to medium spatial frequencies, and tended to collapse in high 
spatial frequencies. The pattern of results suggested that contrast 
threshold increased with external noise, and the rate of increase 
was slower in high spatial frequencies. 

We evaluated the relationship between the slopes and thresholds 
of the forty PFs for each observer through model comparison. 
Each PF was fitted with a WeibuU function: 
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where is percent correct in each condition, cc is the lapsing rate 
and set to be 0.02, fi is the guessing rate of 0.5, t] is the slope of the 
PF, and T is the contrast threshold at 80.3% performance level. We 
tested four different models: (1) the fuU model: aU parameters are 
different for the 40 PFs, leading to a total of 80 parameters (40 
slopes and 40 thresholds); (2) reduced model 1 : the slopes are the 
same but the thresholds are different across the PFs; (3) reduced 
model 2: the thresholds are the same but the slopes are different 
across the PFs; (4) the most reduced model: the slopes and 
thresholds are the same across the PFs. The four models were 
fitted to the PFs with a maximum likelihood procedure, gauged by 
an statistics, and compared with Chi-square test for nested 
models (Eq. 2-4). 
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(4) 



where i denotes a single condition, jV; and are the numbers of 
the total and correct trials, P, is the percent correct derived from 
the WeibuU function; Ppred and Pnieas are the predicted and 
measured percent correct, respectively; Likelihoodf^u and Like- 
lihood„duced are the maximum likelihood of the full and reduced 
models, respectively, dfh equal to the difference of the number of 
parameters of the models. 

The model with the same slope but different thresholds across 
all spatial frequencies and external noise levels accounted for 
89.6%, 90.2%, 91.9%, and 95.8% of the variance for the four 
observers, respectively. Model comparison revealed that this 
model was statistically comparable to the full model (both slope 
and threshold vary; see Table 1) and superior to the most reduced 
model (neither slope nor threshold varies; see Table 1). The slope 
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Figure 4. Contrast sensitivity functions in different external noise conditions. Different colors indicate different external noise conditions. 
doi:1 0.1 371 /journal.pone.0090579.g004 



estimated from the model was 2.15, 3.12, 2.83, and 3.02 
(2.78±0.44) for the four observers, respectively (see Table 2). 
The values were comparable to estimates reported in the literature 
[43,48,54]. 

Contrast Sensitivity Functions 

CSFs corresponding to 80.3% correct performance that were 
derived from the best fitting results are plotted in Figure 4 in eight 
external noise conditions for each observer (also see Table 2). In 
zero external noise, the CSFs showed a typical band-pass or low- 
pass profile. As external noise increased, contrast sensitivity 
decreased, and the CSFs became increasingly flat. Averaged over 



observers and spatial frequencies, contrast sensitivity decreased 
from 47.27 to 3.80, a reduction of 91.8%, as external noise 
increased from 0 to 0.32. Contrast sensitivity varied significantiy 
with SF {F (4, 12)= 169.11, /<= 1.93x10"'"), external noise 
levels {F{7, 21) = 163.02, ^ = 7.54x 10""), and their interaction 
(F{28, 84) = 27. 14,/) =3. 16x10""'). 

PTM Model Analysis 

The perceptual template model described performance accura- 
cy ((f) as a function of signal and external noise contrasts: 



Table 3. Paranneters for the best fitting perceptual template model (PTM). 
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0 ^ where c is the signal contrast, p is the gain of the perceptual 
~ ■- template(relative to its gain to external noise), y characterizes 
OJ s system's non-hnearity, M^jj is the standard deviation of the internal 
Q. ^ additive noise, jV„.; is the standard deviation of external noise, M„ui 

1 g is the proportional constant for multiplicative noise. In the PTM, 

both signal and external noise are processed through the 

c J perceptual system; perceptual decisions are made based on the 



inputs to the decision unit, without knowing what signal and what 
noise is. Eq. 5 describes the input-output relationship in the PTM. 
It does not imply that signal and external noise are known 
independendy by the PTM. Performance accuracy (d) can also be 
expressed as: 

d' = 2xz(P,), (6) 
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to -g where z(Pc) is the z-score of the 2-IFC percent correct. Combing 

g S Eq.5 and 6, we can fit the PTM to the measured PFs and evaluate 

g how the PTM parameters vary across spatial frequencies 

(Table 3). We also constructed a model lattice ranging from a 
model with a single set of parameters across all SF conditions to a 
S 3 model with every single parameter varying with SF. 

.t The best fitted model which allowed both JV„jj and B to vary 

I with SF provided comparable accounts (r =86.5%, 86.2%, 
89.5%, and 96.4% for the four observers, respectively) of the 
variance of the data with the full model that allowed all four 
parameters free to vary, and was superior to all its reduced 
versions, i. e., the model with only Majj free to vary with SF, the 
model with only /? free to vary with SF, and the model with the 

1 _2 same parameters across SFs (see details in Table 4). In other 
ai :~ words, the CSFs measured in a wide range of external noise 
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^ J. multiplicative noise coefficient. 

^ -n Parameters of the best fitting PTM are listed in Table 4. In 
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interesting to note that internal additive noise changed more 
I pronouncedly than the relative template gain. As SF increased 
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Figure 5. Results of perceptual template model analysis and contrast sensitivity functions. Column A: contrast sensitivity functions at 
zero (circle) and the highest external noise (triangle) levels are plotted as functions of spatial frequency; at zero external noise, the contrast sensitivity 
functions show a typical band-pass or low-pass profile. At high external noise, the contrast sensitivity functions have much lower amplitudes and 
don't vary much with spatial frequency. Column B, C, D and E: Nadd, P, Nmui, and y as functions of spatial frequency, respectively. 
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Discussion 

By titrating the visual system with external noise and analyzing 
signal contrast threshold versus external noise functions with the 
perceptual template model, we demonstrated in the current study 
that the shape of CSF measured from 0.5 to 8 c/ deg and in zero 
external noise is mainly a reflection of how internal additive noise 
changes with SF. Although template gain also changed signifi- 
cantly with spatial frequency, the magnitude of variation was very 



limited (less than a factor of 2). On the other hand, multiplicative 
noise and the nonlinear transducer remained constant with SF. 

As the external noise increased from 0 to 0.32, the averaged 
contrast sensitivity decreased from 47.27 to 3.80, and the CSF 
became flatter (Figure 4). The finding that contrast sensitivity is 
largely independent of SF at high noise conditions is consistent 
with previous findings [39,55,56]. Our analysis suggested that the 
internal additive noise changed more profoundly with SF than the 
gain of the perceptual template did, consistent with the results 
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Figure 6. N^m ^nd /'as functions of contrast sensitivity under zero and the highest noise conditions. The filled symbols represent the 
results of zero noise condition; the open symbols refer to the results of highest noise condition. Data from the four observers in the five spatial 
frequency conditions are pooled. CS_zero: contrast sensitivity in zero noise level {Ng„ = 0.00); CS_high: contrast sensitivity in highest noise level 
(Ne„ = 0.32). 
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Figure 7. Modulation transfer function (MTF) and results of the best fitting MTF-incorporated PTM. The first four columns are N^dd, P as 

functions of SF for the four subjects, respectively, after taking the MTF of the eye into account. The gains of the eye to the signal and external noise 
stimuli in the five SF conditions are plotted in the last column. 
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found by Banks et al. [11], McAnany et al. [37], Rovamo et al. 
[39], and Tjan et al. [38] that CSF might be related with internal 
noise, and also consistent with Oruc et al. [40] and Kersten et al. 
[12] that external noise flattened CSFs. Our results suggest that 
model that employs CSF as the front-end SF filter to their inputs 
and a constant additive internal noise in all SF channels [33,56,57] 
might be incorrect and need to be revised. Although those models 
can explain the shape of the CSF in zero external noise, only the 
model presented in this paper can account for CSFs in a wide 
range of external noise conditions. 

All the model parameters of the PTM are specified relative to 
the input signal and external noise stimuli presented to the 
subjects. We can further separate the contributions from the optics 
of the eye and subsequent stages of visual processing. Following 
the suggestion of an anonymous reviewer, we performed some 
additional analysis on our stimuli, taking into account of the 
modulation transfer function (MTF) of the eye. We filtered the 
signal and external noise images used in our experiment with a 
typical MTF for fovea detection [58] and a one-octave band-pass 
filter. The band-pass filter was used to model channel properties of 
the visual system [8] . We calculated the gain of the model to the 
signal grating and external noise in different center spatial 
frequency conditions. The MTF and the band-pass filter were: 

MTF(f) = (\-C)exp(-Axf) + Cx exp(-Bxf), (7) 



(GsPcf 



BPF(f)= exp 



log2(/')-log2(/o)' 



(8) 



where /is the spatial frequency in cycles per degree. A, B, and C 
are 0.172, 0.037, and 0.22, respectively [58];/) is the center spatial 
frequency, a — 0.6 is the standard deviation of the band-pass filter 
with a half-height full-bandwidth of one octave. 

The gain to the signal and external noise stimuli both decreased 
with central spatial frequency: the gains to signal stimuli with 
central frequencies of 0.5, 1, 2, 4, and 8 c/deg were 0.506, 0.469, 
0.404, 0.308, and 0.193, respectively; the gains to the external 
noise were 0.139, 0.130, 0.1 14, 0.089, and 0.057, respectively; the 
ratios between the gains to signal and external noise were: 3.640, 
3.608, 3.544, 3.461, and 3.386. Denoting G, and G„ as the gain to 
the signal and external noise stimuli, we can incorporate the MTF 
into the PTM model: 



(9) 



We fitted the MTF-incorporated PTM to the data. As shown in 
Figure 7, model comparison revealed that the best model is stiU 
the one with both internal additive noise and template gain 
changing with spatial frequency but constant non-linear transduc- 
er and multiplicative noise coefficient. Averaged over four 
observers, internal additive noise increased by 2.77±0.66 log 
units (mean ± s.d.) and template gain increased by 136.4±38.9% 
as spatial frequency increased from 0.5 to 8 c/deg. The 
multiplicative noise coefficient and nonlinearity was constant 
across spatial frequencies. The contrast sensitivity function in the 
zero external noise condition was correlated with both internal 
additive noise [r= —0.94, /;= 1.78x10 ) and template gain [r = 
— 0.84, /) = 4.24x10 **); in the highest external noise condition, 
neither the correlation between internal additive noise and CSF 
(r= — 0.05, /) = 0.82) nor the correlation between template gain 
and CSF ()-=0.29, p = 0.22) was significant. This additional 
analysis allowed us to separate contributions of the MTF of the 
eye and the rest of the visual system to the CSF. 

In the current study, it took 18,000 trials to measure observer 
performance in 200 combinations of spatial frequency, external 
noise, and signal contrast using the method of constant stimuli 
(MCS). Although MCS provided fuU constraints of our model, die 
inefficient test procedure makes it impossible to apply our 
approach in clinical practice. Fortunately, we found that the slope 
of contrast psychometric functions was invariant across spatial 
frequencies and external noise levels. The finding is consistent with 
many previous observations [41,43,48,54,59] and supports the 
"homogeneity assumption" of slope for all pattern-detecting 
mechanisms [60]. The slope invariance provides an important 
regularity for us to exploit in developing quick adaptive methods to 
measure CSFs in different spatial frequency and external noise 
conditions [61]. 

Our approach may be applied to understand contrast sensitivity 
deficits in clinical populations. CSF deficits have been reported in 
amblyopia, dyslexia, glaucoma, and major depressive disorder. It 
will be very interesting to fully characterize the internal noise and 
spatial frequency gain profiles of the observers in those popula- 
tions. Several studies have found that perceptual learning can 
significandy improve contrast sensitivity [22,62-64]. It will also be 
interesting to obtain a snapshot of the visual system using the 
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methods developed in this study before and after training to 
characterize the underlying mechanisms of the improvements. 
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